Self-Supervised Visual Planning with Temporal Skip Connections
نویسندگان
چکیده
In order to autonomously learn wide repertoires of complex skills, robots must be able to learn from their own autonomously collected data, without human supervision. One learning signal that is always available for autonomously collected data is prediction. If a robot can learn to predict the future, it can use this predictive model to take actions to produce desired outcomes, such as moving an object to a particular location. However, in complex open-world scenarios, designing a representation for prediction is difficult. In this work, we instead aim to enable self-supervised robot learning through direct video prediction: instead of attempting to design a good representation, we directly predict what the robot will see next, and then use this model to achieve desired goals. A key challenge in video prediction for robotic manipulation is handling complex spatial arrangements such as occlusions. To that end, we introduce a video prediction model that can keep track of objects through occlusion by incorporating temporal skipconnections. Together with a novel planning criterion and action space formulation, we demonstrate that this model substantially outperforms prior work on video prediction-based control. Our results show manipulation of objects not seen during training, handling multiple objects, and pushing objects around obstructions. These results represent a significant advance in the range and complexity of skills that can be performed entirely with self-supervised robot learning.
منابع مشابه
Path Planning for Robot Navigation using View Sequences
Navigation based on visual memories is very common among humans. However, planning long trips requires also a more sophisticated representation of the environment, such as a topological map. This paper describes a system that learns paths by storing sequences of images and image information in a Sparse Distributed Memory. Connections between paths are detected by exploring similarities in the i...
متن کاملSelf-Stabilizing Supervised Publish-Subscribe Systems
In this paper we present two major results: First, we introduce the first self-stabilizing version of a supervised overlay network (as introduced in [13]) by presenting a self-stabilizing supervised skip ring. Secondly, we show how to use the self-stabilizing supervised skip ring to construct an efficient self-stabilizing publish-subscribe system. That is, in addition to stabilizing the overlay...
متن کاملMulti-Temporal Assessment of Mangrove Forests Change in the Coastal Areas of Bushehr Region Based on Landsat Satellite Imagery
Continual access to precise information about the land use/land cover (LULC) changes of the Earth’s surface is extremely important for any sustainable development program in which LULC serves as one of the major input criteria. In this study, a supervised classification was applied to three Landsat images collected in 1986, 1998and 2018, providing mangrove forests change data in the coastal are...
متن کاملNavigating Through Temporal Difference
Barto, Sutton and Watkins [2] introduced a grid task as a didactic example of temporal difference planning and asynchronous dynamical pre>gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.
متن کاملAn Empirical Exploration of Skip Connections for Sequential Tagging
In this paper, we empirically explore the effects of various kinds of skip connections in stacked bidirectional LSTMs for sequential tagging. We investigate three kinds of skip connections connecting to LSTM cells: (a) skip connections to the gates, (b) skip connections to the internal states and (c) skip connections to the cell outputs. We present comprehensive experiments showing that skip co...
متن کامل